Data Streams: Models and Algorithms Data Streams: Models and Algorithms

نویسندگان

  • CHARU C. AGGARWAL
  • Charu C. Aggarwal
  • Jiawei Han
  • Jianyong Wang
  • Mohamed Medhat Gaber
  • Arkady Zaslavsky
  • Shonali Krishnaswamy
  • Dora Cai
  • Yixin Chen
  • Guozhu Dong
  • Jian Pei
  • Benjamin W. Wah
چکیده

In recent years, advances in hardware technology have facilitated new ways of collecting data continuously. In many applications such as network monitoring, the volume of such data is so large that it may be impossible to store the data on disk. Furthermore, even when the data can be stored, the volume of the incoming data may be so large that it may be impossible to process any particular record more than once. Therefore, many data mining and database operations such as classification, clustering, frequent pattern mining and indexing become significantly more challenging in this context. In many cases, the data patterns may evolve continuously, as a result of which it is necessary to design the mining algorithms effectively in order to account for changes in underlying structure of the data stream. This makes the solutions of the underlying problems even more difficult from an algorithmic and computational point of view. This book contains a number of chapters which are carefully chosen in order to discuss the broad research issues in data streams. The purpose of this chapter is to provide an overview of the organization of the stream processing and mining techniques which are covered in this book.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows

Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...

متن کامل

مدلسازی دیفرانسیلی خشک کردن مواد دوغابی و شبیه سازی عملکرد خشک کن افشانه ای

Process control of a spray dryer that is usually used as the last step of production is very crucial in obtaining a quality standard product. To this end, predicting the effect of various operating and environmental parameters on product properties is essential. Modeling was done in microscopic and macroscopic scales by modifying the mass and heat transfer equations used in investigating the dr...

متن کامل

VMLP neural network design using optimization algorithms to predict spider suspend (Case Study: Watershed Dam Kardeh)

One of the most important processes of erosion and sediment transport in streams is the river most complex engineering  issues.this process special effects on water quality indices, action suburbs floor and destroyed much damage to the river and also into the development plans  Lack of continuity sediment sampling and measurement of many existing stations. due to the low number of hydrometric s...

متن کامل

Application of Markov-Chain Analysis and Stirred Tanks in Series Model in Mathematical Modeling of Impinging Streams Dryers

In spite of the fact that the principles of impinging stream reactors have been developed for more than half a century, the performance analysis of such devices, from the viewpoint of the mathematical modeling, has not been investigated extensively. In this study two mathematical models were proposed to describe particulate matter drying in tangential impinging stream dryers. The models were de...

متن کامل

Optimization of sediment rating curve coefficients using evolutionary algorithms and unsupervised artificial neural network

Sediment rating curve (SRC) is a conventional and a common regression model in estimating suspended sediment load (SSL) of flow discharge. However, in most cases the data log-transformation in SRC models causing a bias which underestimates SSL prediction. In this study, using the daily stream flow and suspended sediment load data from Shalman hydrometric station on Shalmanroud River, Guilan Pro...

متن کامل

Group Testing in Statistical Signal Recovery

Over the past decade, we have seen a dramatic increase in our ability to collect massive data sets. Concomitantly, our need to process, compress, store, analyize, and summarize these data sets has grown as well. Scientific, engineering, medical, and industrial applications require that we carry out these tasks efficiently and reasonably accurately. Data streams are one type or model of massive ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006